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Abstract 

We characterize the rate-distortion function for zero-mean stationary Gaussian sources un- 
der the MSE fidelity criterion and subject to the additional constraint that the distortion is un- 
correlated to the input. The solution is given by two equations coupled through a single scalar 
parameter. This has a structure similar to the well known water-filling solution obtained without 
the uncorrelated distortion restriction. Our results fully characterize the unique statistics of the 
optimal distortion. We also show that, for all positive distortions, the minimum achievable rate 
subject to the uncorrelation constraint is strictly larger than that given by the un-constrained 
rate-distortion function. This gap increases with the distortion and tends to infinity and zero, 
respectively, as the distortion tends to zero and infinity. 



1 Introduction 



Many lossy source coding schemes have the property that the end-to-end reconstruction error is 
uncorrelated with the source. We refer to such schemes as uncorrelated distortion (UD) coders. 
As an example, consider a typical transform coder, as depicted in Fig. [Q Here, a random vector 
X £ M. N is first transformed by an analysis transform T 6 R NxN to yield U = TX. Then U is 
quantized, yielding the vector, U = Q(U). The input signal is finally approximated by Y = TU, 
where T E M. NxN is the synthesis transform, cf. [1,2]. If the quantization error E = U — U is 
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Figure 1: Transform coder. 



uncorrelated to U, and if TT = I, then it is easy to show that Y — X is uncorrelated to X, thus 
yielding a UD coder. 

More generally, any quantization scheme satisfying the following two properties constitutes 
a UD coder: a) The error introduced by the quantizer is uncorrelated to its input; b) The linear 
processing (if any) before and after the quantizer yields perfect reconstruction (PR) in the absence 
of quantization errors. Property a) is satisfied in many cases, e.g. in high-resolution coding [3] or 
when a quantizer with dither (either subtractive [4] or non-subtractive [5]) is employed. On the other 
hand, the PR condition (Property b)) is often imposed (sometimes implicitly) in the design of filter 
banks [6], transform coders [1,2], and feedback quantizers [7, 8]. Thus, any PR source coder using, 
for example, subtractively dithered quantization, is a UD coder. The rate-distortion performance of 
any UD coder can be compared to the underlying Shannon's rate-distortion function R(D) of the 



source, for a given distortion metric. One may question whether such a comparison is, in fact, fair. 
After all, the additional constraint that the end-to-end distortion is uncorrected with the source is 
not imposed upon R(D). With this in mind, let R ± (D) denote the rate distortion function with 
the additional constraint that the end-to-end distortion is uncorrelated to the source. (A formal 
definition of R ± (D) is given in Section |2]>. Clearly, R L {D) > R(D)^ However, to the best of the 
authors' knowledge, the problem of characterizing R ± (D) has not been formally addressed before. 
Therefore, questions such as in which cases (if any) R ± (D) equals R{D), and how R ± (D) can be 
achieved, appear to be unanswered. 

In this paper, we not only give conclusive answers to the above questions, but more importantly, 
we completely characterize R J -(D) for the quadratic Gaussian casd_| as a lower bound for the rate 
achievable under the uncorrelated distortion constrainjH We show, in SectionEl that R ± (D) can be 
parameterized through a single scalar variable a > 0. This is a result which parallels the conventional 
water-filling equations that describe R(D). We characterize the unique optimal statistics that the 
reconstruction error Y—X needs to have in order to achieve R ± (D), for a given Gaussian source X. 
In particular, we show that Y—X must be Gaussian. In addition, we recast the results in a transform 
coding sense. More precisely, we show that if the quantization errors are Gaussian, independent 
both mutually and from the source, then the Karhunen-Loeve Transform (KLT) is optimal among all 
perfect reconstruction transforms, at all ratesQ A comparative analysis between R ± (D) and R(D) 
is then presented in Section [3] There we show that R ± (D) is convex and monotonically decreasing 
in D, and that R ± (D) > R(D),\ID > 0, converging in the limit as R — > oo. Furthermore, we show 
that R ± {D)^<^D — > oo, which is different from the well known result R(D) = 0<^D > a 2 x ^\ 

It is worth emphasizing that our results are not tied to any particular source coding architecture, 
but are general in the sense that any coding scheme in which the end-to-end distortion is uncorrelated 
with the source can do no better than R ± (D). 

Notation We use uppercase letters to represent random vectors, adding a subscript when refer- 
ring to one of its elements, i.e., Xi is the i-th element of the random vector X. The expec- 
tation operator is denoted by E[-]. Uppercase bold letters are used for matrices. The positive- 
definite square root of a positive-definite matrix M is denoted by y/M . We write \M\ and 
tr(iW) for the determinant and the trace of a matrix M, respectively. The probability density 
function (PDF) and covariance matrix of a random (column) vector X are denoted respectively 
by fx and Kx — E[H T ], where X T is the transpose of X. We write Kx,y for the cross- 
covariance matrix between two random vectors X and Y. The spectrum of a w.s.s. random process 
Z with autocorrelation function R z [k] = E[ZiZ i+k ] is denoted by S z (u) = Efcl-oo Rz[k]e~ jkuJ , 
\/lo G [— 7t, 7t]. The differential entropy and the differential entropy per dimension of an iV-length 
random vector X are denoted, respectively, by h(X) and h(X) = jjh(X). When X is a random 
process, h(X) = liniTv^oo jjh(X±, X2, . . . , -Xjv) denotes the differential entropy rate of X. We 
use I(X; Y) and I(X; Y) = j^I(X; Y) to refer, respectively, to the mutual information and the mu- 

1 Enforcing additional constraints can never increase the achievable rate region for a given D. Thus, since the achiev- 
able rate region is lower bounded by i? x (D) we must have J? x (D) > R(D). 

2 By quadratic Gaussian we refer to the case of Gaussian sources with the MSE fidelity criterion. Moreover, we restrict 
our attention to zero-mean sources. 

3 A proof of achievability has recently been derived by the authors in [9]. 

4 The optimality of a KLT has previously been established by a number of authors in a variety of settings, cf. [1,10, 11]. 
However, this appears to be the first time that this result is proven explicitly for J? X (D). 

5 Notice that, in the case of a vanishingly small positive coding rate, we cannot simply reconstruct the source using its 
statistical mean (as we would do in a traditional water-filling solution to R(D)) since this will lead to linear distortion, 
clearly correlated to X (since it is a linear function of X). 



tual information per dimension between two random vectors X and Y. When X and Y are random 
processes , T(X; Y) = limjv^oo jjI(X\, . . . , Xn ; Y\, . . . , Yjy) denotes the mutual information 
rate between X and Y. We write a.e. for "almost everywhere". 

2 Rate-Distortion Function with Uncorrelated Distortion 

We begin by formalizing the definition of the quadratic rate-distortion function under the constraint 
that the end-to-end distortion be uncorrelated with the source. Then, in Section IXT1 we characterize 
this function for Gaussian random vectors, deferring the case of Gaussian stationary processes to 
Section Ell 



Definition 1. The uncorrelated quadratic rate-distortion function R- L (D) for a random vector 
(source) X G M. N is defined as 

^(23)= min I(X;Y), (1) 

Y M[X(Y-X)T]=0, ±tr{K Y _ x )<D, \K X _ Y |>0 
where Y is an N -length random vector. 

2.1 R^^D) for Gaussian Random Vector Sources 

We now present one of the main results of this paper, namely that, for Gaussian vector sources, 
i? _L (D) is given by two equations linked through a single scalar parameter. This resembles the 
"water-filling" equations that describe R(D). The proof of this result, which is presented in Theo- 
rem \T\ makes use of the following lemma. 

Lemma 1. Let X G R N ~ Af(0,K x )- Let Z G M. N and Z G G R N be two random vectors 
with zero mean and the same covariance matrix, i.e., Kz = Kz G , and having the same cross- 
covariance matrix with respect to X, that is, Kx,z = Kx,z G - If %G an d X are jointly Gaussian, 
and if Z has any distribution, then 

I(X-X + Z)>I{X-X + Z G ). (2) 

If furthermore \Kx+z\ = \Kx+z G \ > 0, then equality is achieved in (f2]> iff Z ~ JV(0, Kz) with 
Z and X being jointly Gaussian. 

Proof. Define Y = X + Z and Y G = X + Z G . Then 

I(X; X + Z)- I{X- X + Z G ) = h(X\Y G ) - h(X\Y) = h(Z G \Y G ) - h(Z\Y) 
= ~ jj fz G) Y G {z,y)\ogUzG\Y G ( z \y)) dzd y + J J fzy{z 1 y)\og(f z \ Y (z\y))dzdy 
J J fz,Y( z ^y) l °s(fz G \Y G (z\y))dzdy + JJ f z ,Y{z,y)\og(f z \ Y {z\y))dzdy 

JfY{y)Jf z{Y { Z \y)lo g {J^^ dzd y 

fY(y)D(fz\Y=y\\fz G \Y G =y)dy > 0, (3) 



(a) 



I 



where D(f\\g) is the relative entropy (or Kullback-Leibler distance) between the two probabil- 
ity density functions / and g. The equality (a) follows from the fact that \og( f z G \Y G { z \y)) is a 



quadratic form of z and y, and from the fact that Kzy = Kz g ,y g - The inequality follows from the 
fact that D(f\\g) > 0, with equality iff / = g. Thus, equality is achieved iff fz G \Y G =y = fz\Y=y 
for all y such that /y(y) > 0. The proof is completed by noting that \Kx+z\ = \Kx+z G \ > 
implies jy(y) > for all y G R N . □ 

Remark 1. We note that the above Lemma generalizes Lemma 11.2 in [12], by relaxing the require- 
ment that fz\x = fz and fz G \x = fz G > to tne requirement Kx.z = Kx,z G - 

We are now in a position to present the main result of this section: 

Theorem 1. Let the source X G M. N be a zero mean Gaussian random vector with positive-definite 
covariance matrix K \. having eigenvalues {A^j-j^. Then 



( i) For any positive D, 



where the scalar parameter a G M + is such that 

1 



D = JN £fc=i ^l + ^a-X k ,yD>0. (5) 
(ii) For each D > 0, the value of a that satisfies ([5]) is unique. 

(Hi) Let Y satisfy K X (y-x) = °- tr{K Y -x)/N < D and \K Y -x\ > 0. Then I(X;Y) = 
R^iD) iffZ = {Y-X)~ Af(0, K z *), with 

K z * = ^K\ + aK x - \k x , (6) 

and where a satisfies ®. 

Proof. Let U denote the set of all TV-length random vectors uncorrected to X, and define the sets 
Qd C B d C U as 

Q D = {Z €B D :Z ~ AA(0, K z )}; B D 4 {Z G U : tr(K z )/N < D, \K Z \ > 0}. (7) 
With the above definitions, £T|) can be written as 

R X (D) = min {T(X;X + Z)} ( = } min {T(X;X + Z)} ( = } min {h(X + Z) - h{Z)} 

Z&Iijj Z£y D Z£y D 

= ^ z ^o S \K x+ Kz\-lo g \Kz\} = ± gn{lo g \K?Kx + l\}, (8) 

where (a) follows directly from Lemma[TJand where (b) holds since the definition of Bd (see (Q), 
guarantees that both h(X + Z) and h(Z) exist. 

We now prove, by contradiction, that the minimizer of log \K~^Kx + 1\ in Qd, namely Z* , is 
such that tr(K z *)/N = D. For this purpose, suppose that b = ND/tr(K z *) > 1, and let {Ck}k=i 
be the eigenvalues of K z lKx- Let Z' G Qd be a Gaussian random vector with covariance matrix 
K z > = bK z *- We then have that ti(K z >)/N = D, and that 



JV N 



k=l k=l 



log \K~ z lKx + 1\ = J>g (Cfc + 1) > J>g ( ^ + 1 ) = log \K Z }K X + I\ , (9) 



b 



since b > 1 and because log(-) is a strictly increasing function. Thus Z* cannot be a minimizer of 



log \K Z K X + I\ in Gd unless tr(K z *) = ND. 

The minimizer of logd-Kx + Kx\ / \Kz\) subject to tr(Kz)/N = D can be found using a 
variational approach. More precisely, the covariance matrix of the minimizer, K z * , must necessar- 
ily be such that the derivative of the Lagrangian 



L(Kz) = log \K X +K Z \- log \K Z \ + P*{K Z ) 



(10) 



with respect to Kz is zero at Kz = Kz*, for some (3 G R, which is equivalent to the condition 
that the matrix differential dL(K z ) = 0, \/dK Z - Using the fact that Slog \M\ = tr(M _1 <9M), 
for any positive definite matrix M, the necessary condition for Z* to be a minimizer takes the form 



dL(Kz) = tr[(K x + Kz^dKz] - tr(K z ldK z ) + 0Xx{dK z ) = 0, V8K Z 

K Z =K Z * 

«<=► tx([(K x + Kz*)' 1 - K z l + (31] 8K Z ) = 0,VdK z 
^ {Kx + K z *y l -K z l + f3I = Q < 



K, 



1 -K X . 

2 



(11) 



The fact that K* z needs to be positive definite implies that f3 > and that it is infeasible to have a 
negative sign before the square root in (fTTT) . This, together with the change of variable a = 4//3, 
leads directly to ©, with a > 0. On the other hand, the value of a must be such that the equality 
constraint tr(K z*)/N = D is satisfied. From (fTTT ). and applying Lemma |2] (see appendix), this 

2N 2^fe=l 



requirement is equivalent to D 



N 



tr IK* 



y k + A fc a 



Afc, which proves ©. 



Similarly, (@| is obtained by substituting (fTTT) into ([8]) and then applying Lemma|2l which yields: 



R ± (D) = —log\K z 1 (K x + K z *] 



1 

2iV 



E 1 ^ 



fe=i 




The uniqueness of a is easily verified by noting that the right hand side of © is monotonically 
increasing with a. Since a = 1/(3 is unique, it follows from (fTTT t that the covariance matrix of 
Z* = arg mmz<£B D T{X; X + Z) is uniqud_|, completing the proof. □ 



Transform Coding Realization of R^-{D): Closer examination of Lemma when used in ©, 
suggests that, for a Gaussian source X, i?- L (D) can be achieved by the transform coding archi- 
tecture shown in Fig. [TJ More precisely, an end-to-end distortion having the optimal covariance 
matrix Kz* given by © is obtained by choosing the transform T such that TAT T = Kx, where 
A = diag(Ai, . . . , Xx) (i.e., the KLT transform for X), and by having a Gaussian random vector 

of quantization errors E with ~E[EE T ] = j^dmg({^J A| + A^a' — Afc}^)!! Interestingly, here 

K[EE T ] is not a scaled identity matrix, as is usually the case in KLT transform coding, cf. [13]. 
This discrepancy arises from the approximation E[J5?] = cE[C/|]2~ 26fe , commonly used to link the 

6 This is in agreement with the fact that logQKx + Kz\ / |-K"z|) is strictly convex in K z for \Kx\ > 0, as shown 
in [12, Lemma H.3], together with the fact that the set {K z : ti(Kz) < D, \K z | > 0} is convex. 

7 It is easy to show that these noise variances, namely a% , are such that the derivatives <9/(C/fc; Vk)/da% are the 
same for all k. 



variance of to the bit-rate b/. at which each k-th transform coefficient is quantized. In this ex- 
pression, c > is a constant that depends on the PDF of the source and on the type of quantizer. 
The well known optimal bit allocation analyzed, e.g., in [13], is based upon this formula, and thus 

minimizes the total bit-rate r = ^ J2k=i 1 °g2( E [ c/ fc]/ E [- E ifc]) ~~ \ lo §2( c )- 0n the other hand > the 
optimal quantization errors implied by Theorem Q] need to be Gaussian, their variances being 
such that the end-to-end mutual information I(X; Y) = ^ N ^£Lilog 2 (E[l^]/E[£f ] + l ) is mmi - 
mizedj§ Thus, the difference in the optimal values for {E[£'|]}^ 1 obtained in each case is due to 
the fact that r / I(X;Y). 



2.2 for Gaussian Stationary Random Processes 

The i?- L (D) function defined in CO can be extended to random processes as follows: 

Definition 2. The uncorrelated quadratic rate-distortion function R ± (D)for a random process X 
is defined as 

R X (D)= min I(X;Y), (12) 

where Y is a random process. 

The R ± (D) function for stationary Gaussian random processes can be derived from the results 
obtained in Section 12. 1[ by restricting to random vectors X E R N having a Toeplitz covariance 
matrix, and then letting N — > oo. More precisely, we have the following result: 

Theorem 2. Let the source X be a Gaussian stationary random process with spectrum Sx(&) such 
that Sx(u) > 0, a.e. on [—it, tt]. Then 

( i) For any D > 0, 



where the scalar parameter a € M + is such that 

D = ^f (y/Sx(w) + a - y/S x (w)) ^S x {^)'duj. (14) 



(ii) For each D > 0, the value of a that satisfies (fT4l) is unique. 



i 



(Hi) Let Y satisfy E[X(Y - X) T ] = 0, j, limjy-oo tr(K Y -x) < D, limjv-^oo \K Y -x\" > 0. 
Then I(X;Y) = R^(D) iff Z = Y — X is a Gaussian stationary random process with 
spectrum 



Sz(u>) = - [\/Sx(.v) +a' - ^/S x {u) ', a.e. on {-it, it]. 



(15) 



Proof. Define, from the random processes X and Y, the vectors X^ = [X\ ■ ■ ■ Xx] T , F^' = 
[Yi ■■■ Y N ] T , JV € N. It is known that \W > \( N+l \ VJV G N, where A^' and \( N+ V ^ 
the smallest eigenvalues of K X ( N ) and K X ( N +i), respectively (see e.g. [14, Theorem 4.3.8]). This 

8 The mutual information per dimension between X and Y in this case equals the sum of the mutual informations 
between each pair Uk, Ek, since all these scalars are mutually independent and T is invertible. 



result, together with Lemma [3l in the Appendix, and the fact that Sx(w) > 0, a.e. on [— 7r,7r], 
implies that \K xW \ > 0, for all N G N. We can then apply Theorem [T]to each X (N \ ViV G N, 
obtaining 



ViV G N, 



(16) 



where a^ satisfies 
D 



1 

2iV 2^k=i 



AN) 



+ 



A W a (iV)_ A W ViV G N 



(17) 



and where {Ai, N '}^ =1 denotes the set of eigenvalues of K X W- From (fT4l) . the optimal distortion 
covariance matrix for is 



K<£>±^/K* x + aK x ~^K X . 



1 



■z* 2 

Direct application of Lemma [3] (see the Appendix) to ([TBI and (TTTT ) yields 



fl-Lp) = lim R^iD) = ±- f log f 



(18) 



(19) 



wherein a = liniTv^oo is the only scalar that satisfies 

D = ^f (VSx(co) + a- a/S^T) ^fSxJu). 
Similarly, applying Lemma[3]to (PT8l) we obtain (fT3T ). This completes the proof. 



(20) 



□ 



Remark 2. ft is interesting to note that the equations characterizing the optimal UD feedback 
converters derived in [8] achieve an end-to-end distortion whose spectrum is given precisely by (115 b - 
Furthermore, it is easy to show that such converters would achieve the R 1 - (D) function if the noise 
due to the scalar quantization within the feedback loop were white Gaussian noise uncorrelated 
with the input. 



3 Comparison with R(D) 

The next theorem shows that R ± (D) shares strict monotonicity and convexity with R(D), but 
deviates from R{D) in the asymptotic limit of large distortions. 

Theorem 3. For any Gaussian random vector (stationary random process) X with positive definite 
covariance matrix Kx (with Sx(w) > 0,a.e. on [— ir,ir]), the function R^(D) is monotonically 
decreasing and convex. In addition, D — ► oo R 1 - — > 0, and D — > R 1 - — > oo. 

Proof. We present here only the proof for the case of Gaussian random vectors. The proof for 
Gaussian stationary processes proceeds in an analogous fashion. Monotonicity: We have that 
= ^Ht/t^:> provided that A^- and ^ exist and that the latter derivative is non-zero. 
From (0]), we obtain 



dR L 
da 



1 ^iV 



d_ 

da 



log (y/X k +a + yf\~ k 



log (a) 



1 ^JV 
AT Z^/k=l 



2N 



On the other hand, from ([5]), 



VAfc + a - 



a 



1 

a 



2Na 



a 



(21) 



dD 

da 

and thus 



8 



1 - 

—Y 

2N^ da 

fc=l 



X\ +X k a -A fc 



dR ± (D) 
dD 



1 - 

-Y 



AN 



k=l 



a 



1 N 

—Y 

AN^ 



VaTT 



o 



--, VL»>0, 
a 



(22) 



proving that R ± (-) is a strictly decreasing function (since a > 0). Convexity: The fact that a grows 
monotonically with increasing D, together with (|22|) . imply that — a \ ' \ „ „ > ^ 



9D 



lD=Di 



<9D l£»=Z5 2 ^~ 

Di > D 2 , and thus i^-) is convex. Limits: It is clear from © that liniQ^oo R 1 - = and 
lim^^o R ± = oo. Since, as can be seen from (|2"TI) . decreases monotonically with increasing 
a, Vq 6 (0, oo), it follows that R ± — > <^=^ a — ► oo, and that — > oo -4=>- a — ► 0. 
Similarly, it follows from © and the monotonicity of D with respect to a that, for fixed {Afcjj^, 



D 
D 



• oo •<=>■ a — > oo and Z) — > <^=^ a 
■ R 1 - — > oo, completing the proof. 



0. We then have that D — > oo 



and 
□ 



We next show that R ± (D) > R{D) for all D > 0, converging asymptotically as D — > 0. 



Theorem 4. For any Gaussian random vector (stationary random process) X with positive definite 
covariance matrix K x (with Sx(^) > 0,a.e. on [— tt,7t]), the following holds 



(i) R ± (D) - R(D) >0, V£>>0; 



( ii) lim 

D^o 



R ± (D)-R(D) 



0. 



(23) 



Proof. We present here only the proof for the case of Gaussian random vectors. The proof for 
Gaussian stationary processes proceeds in an analogous fashion. Recall that for a Gaussian ran- 
dom vector X having positive covariance matrix with eigenvalues {AfcjfcLi one has that R(D) > 
277 SfcLi l°g(-W-^)> with equality iff D < minfcjAfc}^. As a consequence, 



1 - 

R^(D)-R(D) < ^(D)-— J>g(A fc /Z>) 



(24) 



k=l 



(•) J: 
2iV 



JV 



1 



fe=l 
JV 



1 



o 



2JVA, 



k=l 



VAfc + a'+ VA^ 

VAT 



i=l 
JV 



— Y 

2N ^ 



A; 



i \ Xf + X { a + Xi 



(25) 



Equality (a) is obtained by substituting © and (O into the right hand side of (1241 . The validity 
of d2"3"b-(n) then follows directly by taking the limit of the right hand side of (1251) as a — > 0. In 
order to prove d23l-(i). we will show that D ± (R) > D(R), where the function -D _L (-) is the inverse 
of R ± (-) and D(R) is Shannon's distortion-rate function. For this purpose, consider the random 
vector Y' = W(X + Z*), where W £ M ArxAr is a symmetric, positive-definite matrix, and Z* is 
as in Theorem[T]-(m). Notice that Y' — X and X are not uncorrelated unless W = I. The mutual 



information per dimension between Y' and X is given by I(X; Y') = h(WY) — h{WY\X) = 
h{WY)-h(WX + WZ*\X) = h(WY)-h(WZ*) = h(Y)-h(Z*) = I(X; Y). Thus, for any 
positive definite W, I(X; Y') = R ± (D). We next show that for any D (and corresponding Kz*), 
choosing an optimal matrix W yields a Y 1 whose distortion j^tic(KY'-x) is strictly smaller than 
D. It is easy to show that tr(KY'-x) is minimized by choosing W = W*, where W* = (Kx + 
Kz*) _1 Kx is the Wiener filter (matrix) for X + Z*. From this equation, we define the function 
D'(R) = jfti(K w *( X +z*)-x), describing the distortion associated with Y' = W*(X + Z*), 
with the covariance matrix of Z* as in © when R = R. Since D{R) < D'{R), we obtain, from 
applying Lemma El and after some algebraic manipulation, that 

D ± (R)/D(R) > D ± (R)/D'(R)=ti[K z ,}/ti[(K x +K z ,)- 1 K z ,Kx] 

= \ (eL U x i +x * a - a a ) I eL (V A ' +Afca ' - A *) 2 > ^ (26) 

where the last inequality follows from the fact that a/2 > yj a| + X^a — A/., Va > 0. Finally, 

the fact that both R{D) and R ± (D) are monotonically decreasing functions, together with (l26l ). 
implies (l23l)-(i). This completes the proof. □ 

The summation term in (l25l) describes exactly the rate loss Rl(D) = R ± (D) — R{D) for all 
D < minfc{Afc}^ 1 . For D within this range, it can be shown, using Chebyshev's sum inequality, 
that 8Rl{D) / da > 0, which implies that Rl{D) increases monotonically with increasing D. On 
the other hand, for all D > 0, the ratio D ± (R)/ D(R) can be lower bounded by (1261) . It can be 
shown that this bound increases with a (and thus with D as well), tending to oo as a — ► oo, which 
is in agreement with Theorem [3] 

4 Concluding Remarks 

In this work we have completely characterized R ± (D), the quadratic Gaussian rate-distortion func- 
tion subject to the constraint that the end-to-end distortion be uncorrected with the source. We have 
further proved that this function shares convexity and monotonicity with Shannon's rate-distortion 
function R(D), but R ± (D) is positively bounded away from the latter, converging to R{D) only 
in the limit as the distortion tends to zero. We showed that the uncorrelation constraint causes the 
distortion to unboundedly grow as the rate tends to zero. We also discussed the achievability of 
R ± (D) for random vectors and stationary random processes through transform coding and feed- 
back quantization architectures. 

5 Appendix 

Lemma 2 (Adapted from Corollary 11.1.2 in [15]). Let 

A = Qdiag(Xi, . . . , X N ) Q 1 = ^ fc=1 hQ-.^Qk^ 

with Q G c^xA^ q fc anc l Q" 1 denote the k-th column and the k-th row ofQ and 

respectively. If /(■) is analytic in a neighbourhood around each Xk, for k = 1, . . . , N, then 



f(A) = Q diag(f(X l ), f(X N )) Q 1 = J2 k=1 f^k)Q: lk Qki- 



(27) 



Lemma 3 ( Theorem 4.5.2 in [16] ). Let A^ be an infinite Toeplitz matrix with entry £ K. 
on the k-th diagonal. Then the eigenvalues of A^ are contained in the interval m < A < M, 
where m and M denote the essential infimum and supremum, respectively, of the function f(uj) = 
^2'kL-oo a k e ~'' kuJ - Moreover, if both m and M are finite and G(X) is any continuous function of 
A € [m, M], then 

£L jf EH, = I £ G ^<<"- < 28 > 

where the A^ are the eigenvalues of the sub-matrix A^ N ' £ M> NxN of A^ centered about the 
main diagonal of A^. 
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